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Abstract 

We present a theoretical as well as experimental in- 
vestigation of a population of self-replicating seg- 
ments of code subject to random mutation and sur- 
vival of the fittest. Under the assumption that such 
a system constitutes a minimal system with charac- 
teristics of life, we obtain a number of statements 
on the evolution of complexity and the trade-off be- 
tween entropy and information. 

1 Introduction 



O 

Owhat are the defining characteristics of life? For biolo- 
J^gists and those studying living systems, this is the ques- 
^-ption that predates other questions and generates passion- 
Sate arguments; for there are several ways how to answer 

this question. On the one hand, there is the philosophi- 

&©al approach. We would invent a "Turing Test of Life" in 
Qthe tradition of the empiricists: "Anything that appears 
I to be alive, is alive". On the other hand, we may draw 
OhP a list of characteristics and decide the question case 
C3by case, through a look-up table of sorts. To the scien- 
' ytist interested in the foundations of life, however, neither 



'of these approaches is satisfactory. Rather, the scientific 
method applied to the problem of obtaining the defining 
characteristics of life would demand that we construct a 
model of life, one that displays the necessary attributes 
to pass the "Turing Test of Life" , and then strip it down 
to the most simple system that still displays these char- 
acteristics. Anything that, if removed from this most 
simple system would result in a drastic change, remov- 
ing the life-likeness, must be considered as a defining 
characteristic of this system. This does not preclude the 
existence of other systems passing the same test, with 
different characteristics. It does however establish the 
model under consideration as a baseline, and its charac- 
teristics as universal in its class. It is after this step has 
been taken that we can further enrich the model to learn 
more about life. 

Clearly, this kind of approach is impossible using nat- 
ural systems, as all life currently extant shows consid- 
erably more sophistication than needed just to survive. 
Thus, the construction of the minimally living system 
must be artificial. This has recently become possible 
with the advent of Tom Ray's tierra [1-3], and systems 



based on his idea. Briefly, tierra-like, (or "auto-adaptive 
genetic" [4]) systems consist of a population of self- 
replicating strings of information coded in a suitable 
manner that "live" , and coevolve in an environment sub- 
ject to mutation. The information contained in the self- 
replicating strings is the information necessary to carry 
out the task of self-replication, as well as any informa- 
tion that might give a certain genotype an advantage 
over another. Typically, this is information about the 
population itself, and the environment it lives in. Such 
a system displays a wealth of complexity and "life-like" 
characteristics [1-6]; enough to suggest answers to a num- 
ber of fascinating questions relating to life. 

Our approach then will be this: We assume that such 
systems of self-replicating strings do constitute a "mini- 
mal living system" , and if not minimal then close enough 
to support the notion that the results obtained using the 
model are reflections of its "aliveness" , rather than ar- 
tifacts. While the theoretical results are independent of 
a particular implementation, the experimental ones are 
not. It is beyond the scope of this article to examine 
how close the particular system is to the abstract mini- 
mal living system, yet our results suggest that they are 
a good approximations, and in many respects univer- 
sal. In the following we present theoretical arguments 
and experiments supporting them that suggest answers 
to a number of fascinating problems, including the emer- 
gence of complexity in living systems, and the nature of 
the evolutionary process. 

Let us attempt to isolate the main characteristics of 
the "minimal living system" . Primarily, we find that the 
cornerstone of this system is self-replication. The sta- 
tistical mechanics of the ensemble of strings with and 
without self-replication are so vastly different (see Sec- 
tion 2) that we can determine with confidence to have 
isolated a defining characteristic of living systems. While 
this in itself is not surprising, it is specifically the self- 
replication of the information contained in the genome 
that is necessary (and that in this simple system is tan- 
tamount to replication of the entity). The consequences 
of this fact alone are manifold. Considering the evidence 
for genetic similarity in tRNA from the time of the origin 
of life to today [7], this capability seems to allow for the 
conservation of information over infinite periods of time 
in a noisy environment. 

The keys for the evolution of complexity in this system 



are: bit-wise mutation acting on the genome of mem- 
bers of the population, and survival of the fittest. The 
former is again an obvious choice for a necessary condi- 
tion: Without mutation, a population of self-replicating 
strings shows no patterns, no evolution, and no com- 
plexity. Clearly, mutation works against the force that 
attempts to preserve information, yet at the same time 
mutation creates information, or rather transfers it from 
the environment into the genome of the strings, where 
again it will be preserved by self-replication. We find 
that "survival of the fittest" is synonymous with sur- 
vival of the most populous. Thus, fitness in this system 
is directly related to replication rate. In the absence of a 
mechanism that enables one string to prevent the repli- 
cation of another (kill-strategy), it is conceivable that 
"fitness = replication-rate" is a universal feature of such 
low-level organisms. 

Another mystery of living systems is associated with 
the evolution of complexity and its apparent contradic- 
tion with the second law of thermodynamics, the law of 
increasing entropy. While it is known that dissipative 
systems involving microscopic irreversibility can evolve 
from disorder to order, the precise mechanism is often 
unclear. In this model, the evolution of complexity ap- 
pears to proceed by a trade-off between entropy and in- 
formation. Information is transferred from the environ- 
ment into the genome of the population via chance mu- 
tations that allow the mutated string to exploit the infor- 
mation for a higher replication rate. As a consequence, 
the (information-theoretic) entropy of the population of 
strings drops by a commensurate amount until off-spring 
of the mutated string have reestablished disorder and 
the previous level of entropy is (roughly) restored. The 
time between such events is distributed according to a 
power law, and we thus observe periods of stasis inter- 
rupted by "avalanches of invention" . Again, if we accept 
this system as a model for the most primitive and ba- 
sic living system, the evidence suggests that evolution 
is Darwininan in nature even though not gradual, but 
rather punctuated: the picture promoted by Gould and 
Eldredge [8]. Below, we shall give more details and re- 
sults of experiments that support the claims made here. 

2 Statistical Mechanics of 

Self-Replicating Bit-Strings 

Let us approach the model system from a theoretical 
point of view and write down a few of the basic equations 
governing the dynamics of self-replicating strings. Con- 
sider a system of N strings of £ instructions, where each 
instruction can take on p values (p = 2 for binary strings, 
p = 4 for DNA, and p = 32 for Ray's tierra). Also, let 
there be N g different genotypes (different combinations 
of instructions) extant in the population, N g < N , and 
let rii denote the number of strings of genotype i. Then, 



N 



(i) 



Let us further assume that the total number of strings in 
the population is constant: dN/dt = (this constraint 



can be relaxed trivially). Also, denote the replication 
rate (off-spring per unit time) of genotype i by S{. Defin- 
ing the average replication rate (average "fitness") 



Pi£i 



(2) 



where pi is the genotype density of species i, we can 
immediately write down the equation that describes the 
time-evolution of n 8 - in a "mean-field" approximation 

N 

m(t + 1) - m(t) = (si-{s}-Rl) m(t) + —M 



limit) + C 



(3) 



with obvious definitions for ji and C, the "source" 
term. Here, we introduced the mutation rate R in units 
[mutations-(site • time) -1 ]. Thus, Rl is the probability 
that a string of length I is hit by a mutation per unit 
time, i.e. per instruction executed. The constant term 
on the right hand side of (3) is modelling mutations act- 
ing on genotypes j ^ i and ensures N = 0. Equation (3) 
can be solved in the equilibrium limit, de/dt = 0: 



rii(t) 



Rl 
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(4) 



using an equilibrium boundary condition, n;(0) 
N I N g = n. For C ~ this is simply 



m(t) = n;(0)e 



(5) 



where ji = e% — e — Rl is the "growth factor" . Clearly, in 
equilibrium ji = only for the best genotype and ji < 
for inferior ones. 

In order to make contact with statistical mechanics, 
define the inferiority 



— £best &i 



(6) 



where £best is the highest replication rate in the popula- 
tion. Then, E{ > and we can define the ground-state 
("vacuum") of the population by 

(0|£|0) = E = . (7) 

Let us also define the "fine-grained" (Shannon-) entropy 



S 



with the property 



dS_ 

dE 



1 

Rl 



(8) 



(9) 



which suggests that the system of self-replicating strings 
is analogous to atoms with eigen-energies E{ in a heat 
bath of temperature Rl which initiates transitions E{ —> 
Ej via mutations. In fact, detailed balance arguments 
lead to the equation determining the equilibrium distri- 
bution rii(Ei, E) 



1 



1 



(10) 



with the solution 

ni(Ei,E) 



C 



Ei-E 



Rl 



(11) 



In equilibrium, we find E —> Rl, universally. However, 
this equilibrium is disrupted by spontaneous phase tran- 
sitions triggered by mutations creating genotypes with 
£i > Sbest with a "latent heat" L = As = s^™ - £^ e d st . 
After such an event the entropy of the system defined 
by (2) drops, and then rebounds. In Fig. 1 we com- 
pare the entropy of a system of strings subject to mu- 
tation with or without replication in the avida environ- 
ment [6] 1 . If we start both systems in the uniform state 
E = (which implies S = 0: all strings are identical), we 
see that the self-replicating population reaches a plateau 
with S < log(A) while the non-self-replicating system 
reaches the maximum value rather quickly. In addition, 
the population of self-replicating strings can lower the 
entropy spontaneously, a feat that is impossible for the 
non-replicating population which must strictly adhere to 
the second law of thermodynamics. 
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FIG. 1 Entropy of a population of 400 self- 
replicating (solid line) and non-replicating (dashed 
line) strings started in the uniform state. 

3 Entropy and Information 

Let us investigate more closely the time-evolution of en- 
tropy in systems of self-replicating entities. From the 
point of view of statistical mechanics rather than in- 
formation theory, entropy is just the logarithm of the 
number of states in the volume of "phase-space" oc- 
cupied by the population. Genetic "phase-space", or 
genotype-space, is spanned by the combinatorics to as- 
semble the basic building blocks (or bits), and is finite 
for finite length strings. The population forms a "cloud" 
in genotype-space that moves slowly according to the 
fitness-landscape. Each genotype moves along a trajec- 
tory determined by a gradient term (adaptive process) 
and a noise term (mutations). As a consequence, the 



J We chose this system because the number of strings can 
be kept strictly constant in avida. 



time-evolution of the genotype of a string is determined 
by the usual Langevin equation. While genotype-space 
is very high-dimensional, the number of states occupied 

and thus, 



by the population is by definition N g , 

s = io g (Ag 



(12) 



which is equivalent to the fine-grained entropy in the 
large N limit. The number of genotypes N g is an equi- 
librium quantity, depending on mutation rate Rl, total 
number of strings N , and most importantly, the average 
inferiority of the population. Clearly, only in infinite sys- 
tems after infinite time is \og(N g ) = l\og(p). For finite 
systems, the number of genotypes is determined by the 
rate of birth of new genotypes and the rate of extinction 
of existing ones. The rate of birth of new, viable, geno- 
types is determined by the ratio of "cold" to "hot" spots 
in the genome. A cold spot on the string is an instruction 
that, if mutated, leads to either death or severe inferi- 
ority. Hot spots are instructions that are neutral under 
mutation and are thus responsible for the e-degeneracy 
of genotypes. Thus, if there were no hot spots in the 
genome, there could not be any diversity, and in fact no 
evolution could take place, as was conjectured some time 
ago [9]. 

Let us denote the number of cold spots in a string 
of length I by v c . Then the birth rate of new geno- 
types is RIN , while the birth rate of viable genotypes 
is (1 — v c j '1)RIN . On the other hand, the extinction 
rate of existing (viable) genotypes depends (again in a 
mean-field approximation) on the average inferiority of 
the population, E. Clearly, for zero inferiority there will 
be no extinctions, while extinctions are maximal during 
a phase transition, when the average inferiority is high. 
We thus find 



N g = NRl{\ 
and thus in equilibrium 
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We are now ready to examine the dropping of entropy 
observed through phase transitions. The decrease seen 
in Fig. 1 is of course due to the discovery of information 
and a corresponding new £best- Through such a transi- 
tion, E —> E + As and AS = As/(Rl). However, with 
time E relaxes, and the entropy rebounds, but not nec- 
essarily to its previous equilibrium value. Consider for 
instance the situation where the information gained in 
the process requires more cold spots v c for storage than 
before. Then, a phase transition effectively freezes some 
spots that were originally hot spots, and thus reduces 
the effective length of the string. This is the situation in 
Fig. 1, the entropy rebounds, but reaches a lower plateau 
than before, due to the incorporation of information in 
the genome. We are thus witnessing a curious trade-off 
between information and entropy. In fact, it has been 
conjectured [10] that the physical entropy is not given 
by just the fine-grained entropy, but rather is the sum of 
Shannon entropy S(p) and algorithmic complexity K(p), 
the latter being a measure of information stored in a 



string. In this scenario, complexity can emerge without 
violating the second law, simply by trading information 
for entropy while keeping the physical entropy constant. 



4 Fractal Structure of Evolution 



Earlier we described how macroscopically the system is 
in an equilibrium state only rarely disrupted by chance 
mutations that happen to improve the fitness of the pop- 
ulation. Here, we would like to take a more quantitative 
look at this behaviour, and work out the consequences 
for the structure of evolution. 

More and more evidence is surfacing that certain 
quantities relating to living systems are distributed ac- 
cording to power laws, such as the frequency of extinc- 
tion events of a certain size [11]. It has been known for a 
long time that there is a structure in the taxonomic sys- 
tem, since it was pointed out by Willis [12] that there is a 
very large number of taxa with only one or a few subtaxa, 
but only a few taxa with many subtaxa. Recently, there 
has been an extensive analysis of the frequency distribu- 
tion of the number of taxa N that have n sub-taxa [13], 
with the result that N ~ n~ D with D « 2 ± 0.5. This 
fractal structure of evolution has been obtained from a 
pain-staking analysis of fossil records from the Cambrian 
to the Tertiary (families within orders) as well as a se- 
lection of catalogued flora and fauna (subclades within 
clades, families within orders, and species within gen- 
era). Yet, an explanation for this structure could not be 
offered. Generally speaking, such a distribution can be 
expected if no scale determines the length of time that 
a certain species dominates a population, and the num- 
ber of subfamilies is proportional to this period of time. 
This hypothesis can be tested using the minimal living 
system. 

Recently [5], we suggested that self-replicating systems 
typically operate in a self-organized critical state [14]. 
Drawing the analogy to the sandpile model, we could 
identify information as the agent that causes the self- 
organization, and which is transported via a dissipa- 
tive process through the system. We also identified the 
growth-factor ji [see Eq. (5)] as the critical variable. In 
the equilibrium situation, the population is dominated 
by the current "best" genotype and its e-degenerate off- 
spring, all of which have ji = 0. An advantageous mu- 
tation however can disrupt this equilibrium by creating 
a new "best" genotype with ji > 0. Thus, the vacuum 
with E = is now a false one, and a phase transition 
must occur. This happens in the manner of the sand- 
pile avalanches, with the new information transmitted 
through the system via the off-spring. Gradually, all 
genotypes with a subcritical replication rate will become 
extinct and replaced: the system has returned to its crit- 
ical state. A necessary condition for the identification of 
the self-organized critical state is a power spectrum of 
the fitness history (replication rate as a function of time) 
that is of the power law type. 




1.5 2.0 2.5 3.0 
10 9 Instructions 



FIG. 2 Fitness curve for a typical run. The (nor- 
malized) fitness a of the most successful (i.e., most 
populous) genotype is plotted as a function of time, 
measured every million instructions for a mutation 
rate R = 0.65 x 10" 3 . 

A typical fitness history (normalized replication rate 
of the most abundant genotype in the population as a 
function of evolutionary time) obtained using the tierra 
system is displayed in Fig. 2. 

The associated power spectral density in Fig. 3 shows 
a near perfect l// 2 spectrum over 4 orders of magni- 
tude. While a power-law spectral density is a necessary 
condition for the identification of a self-organized criti- 
cal state, it is not sufficient. Indeed, a l// 2 spectrum 
is typical for a simple random walk, and thus for the 
most basic dissipative process. Incidentally, the process 
of evolution can be described as a guided random walk 
in genotype space, albeit a non-Markovian one. 




FIG. 3 Power spectrum P(f) of a typical fitness 
curve a(t) (Fig. 2). The dashed line is a fit to 
P(f) ~ f~ P with /3 = 2.0 ± 0.05. 

On the other hand, the distribution of waiting times 
between events of a certain size in a random walk model 
is exponential, rather than of the power law type. The 
latter is expected if the time between such events is not 
controlled by any scale. The fundamental time scales 
in self-replicating systems of the sort discussed here are 
the gestation time (the time for a string to gestate one 
off-spring) and the average time between two mutations 
hitting a single string (see Ref. [4]). Both of these time 
scales, however, are much smaller than the lifetime of 
single genotypes. Also, as discussed earlier in connec- 
tion with the heat-bath analogy, the system can be un- 
derstood as a superposition of a very large number of 



meta-stable states, with transitions between them in- 
duced by the mutations. In such a system, the time 
between avalanches or "events" is distributed accord- 
ing to a power law, as is the size and duration of the 
events [14]. Using the tierra system at a mutation rate 
of R = 0.65 x 10 -3 (mutations per instruction executed), 
with a total number of 131,072 sites inhabited by typ- 
ically 600-1,400 strings of length 60-150, we have mea- 
sured the frequency distribution N(t) of waiting times r 
(lengths of "epochs") between phase transitions (which 
are signalled by a discontinuous jump in fitness as in 
Fig. 2). Figure 4 shows the integrated distribution func- 
tion 

M(t) = -J N(t)dt , (15) 

The integrated quantity is distributed with the same ex- 
ponent as N(t) (as can be shown by direct differentia- 
tion), but is more reliable due to improved statistics. 
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FIG. 4 Integrated distribution M(t) of times be- 
tween phase transitions r (length of epoch) mea- 
sured in millions of executed instructions. 

The data were obtained from 50 runs under identical 
conditions (save the random number seed) resulting in 
512 waiting times. The resulting curve was fit to the 
standard power law, cut-off by finite-size effects 2 

M(r) = cr-«exp(-^) (16) 

and obtained a = 1.1 ± 0.05 and T = 450 ± 50, where 
all times are in units of millions of executed instructions. 
We expect the exponent of the power law to be universal, 
while the cut-off of course is not. 

There are a number of fascinating consequences of 
such a distribution. If the time between phase tran- 
sitions, as well as the size of fitness increases during 
such transitions are distributed according to a power law, 
curves such as the one depicted in Fig. 2 are fractal in 
nature, i.e. they (ideally) appear similar at all scales 
("Devil's Staircase" 3 ). If this is the case, we can imag- 
ine that a more sophisticated model, one that can run 

2 Since all runs were terminated after between 500 million 
and 4 billion instructions were executed, we cannot expect 
to measure the distribution of waiting-times larger than 500 
million instructions with satisfactory accuracy. 

3 See, e.g., [15]. 



many orders of magnitudes longer than the one we used, 
with unlimited fitness increases, would yield just such 
a curve, where each element of the curve when exam- 
ined closely would have the same structure as the one 
in Fig. 2. Then, we must admit that fitness increases 
of enormous magnitude can happen almost instantly, 
driven by microscopic point-mutations only 4 , albeit ex- 
tremely infrequently. Also, the model implies that at 
each such phase transition a large number of genotypes 
goes extinct dynamically [see Eq. (14)] without external 
interference (this was also observed in [3]). Thus, Raup's 
distribution of extinction events would equally follow. 

On the other hand, it is tempting to believe that the 
longer a species dominates a population, the more sub- 
species will be generated. As a consequence, the fre- 
quency distribution of taxa with n subtaxa, N(n), must 
show the same critical exponent as the waiting time dis- 
tribution 5 . 

5 Conclusions 

We proposed to model life using artificial living sys- 
tems that are "minimal" in their characteristics. We 
determined self-replication of information, bit- wise mu- 
tation of the information (providing the noise), and an 
environment of information (providing the gradient of 
the fitness-landscape) as necessary characters of such a 
model. The artificial life systems tierra and avida were 
used to determine universal characteristics of living sys- 
tems, such as the evolution of complexity through trans- 
fer of information present in the environment into the 
genome, and its relation to the second law of thermody- 
namics. Furthermore, we were able to conjecture that 
living systems typically evolve towards a self-organized 
critical state and that this state determines fractal pop- 
ulation structures. This also constitutes strong evidence 
for a punctuated equilibrium picture of evolution as 
favoured by the fossil record. 
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